Picture for Xiaobao Wu

Xiaobao Wu

Epistemic Context Learning: Building Trust the Right Way in LLM-Based Multi-Agent Systems

Add code
Jan 29, 2026
Viaarxiv icon

P2P: A Poison-to-Poison Remedy for Reliable Backdoor Defense in LLMs

Add code
Oct 06, 2025
Viaarxiv icon

Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning

Add code
Jul 29, 2025
Viaarxiv icon

Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning

Add code
Jun 10, 2025
Figure 1 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Figure 2 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Figure 3 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Figure 4 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Viaarxiv icon

Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings

Add code
Jun 05, 2025
Viaarxiv icon

SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation

Add code
May 20, 2025
Figure 1 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Figure 2 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Figure 3 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Figure 4 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Viaarxiv icon

Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models

Add code
May 05, 2025
Figure 1 for Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Figure 2 for Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Figure 3 for Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Figure 4 for Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Viaarxiv icon

Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation

Add code
Apr 17, 2025
Viaarxiv icon

HyperGraphRAG: Retrieval-Augmented Generation with Hypergraph-Structured Knowledge Representation

Add code
Mar 27, 2025
Viaarxiv icon

Full-Step-DPO: Self-Supervised Preference Optimization with Step-wise Rewards for Mathematical Reasoning

Add code
Feb 20, 2025
Viaarxiv icon